Express Mail No. EV 686 490 129 US 



Attorney Docket No. 


RIGL-008CIP 


Confirmation No. 


6653 


First Named Inventor 


Ying Luo 


Application Number 


09/715,725 


Filing Date 


November 16, 2000 


Group Art Unit 


1642 


Examiner Name 


Susan Ungar 


Title: NOVEL lAPS ASSOCIATED CELL CYCLE 
PROTEINS, COMPOSITIONS AND METHODS OF 
USE 



APPELLANTS' BRIEF 

Mail Stop Appeal Brief-Patents 
Commissioner for Patents 

P.O. Box 1450 
Alexandria, VA22313-1450 



Sir: 



This Brief is filed in support of Appellants' appeal or the rejections set forth in the 
Office Action dated May 5, 2006, 2006. A Notice of Appeal was filed on February 10, 2006. 



The Commissioner is hereby authorized to charge deposit account number 50-0815 to 
cover any fee required for filing this brief 




Atty Dkt. No.: RIGL-008CIP 
USSN: 09/715,725 



Table of Contents 



Contents Page 

Real Party in Interest 3 

Related Appeals and Interferences 3 

Status of Claims 3 

Status of Amendments 3 

Summary of Claimed Subject Matter 3 

Grounds of Rejection to be Reviewed on Appeal 5 

Arguments 6 

Summary 16 

Claims Appendix 18 

Evidence Appendix 19 

Related Proceedings Appendix 20 



2 



Atty Dkt. No.: RIGL-008CIP 
USSN: 09/715,725 

Real Party in Interest 

The real party in interest in this case is Rigel Pharmaceuticals, Inc. 

Related Appeals and Interferences 

There are no appeals pending which would directly affect, be directly affected by, or 
have a bearing on the Board's decision in the instant appeal. 

Status of Claims 

Claims 26, 27, 29, 30 and 32 are pending. Rejections of claims 26, 27, 29, 30 and 32 
are appealed herein. 

Claims 1-25 and 31 are cancelled. 

Status of Amendments 

No amendments to the claims were filed subsequent to issuance of the prior Office 

Action. 

Summary of Claimed Subject Matter 

The pending claims are directed to a recombinant ING2 protein. Wild-type ING2 
protein has a high level of sequence identity to INGl (Fig. 1 1 and page 6, line 20), a protein 
that has been well characterized structurally and functionally (page 7, lines 12-24). The 
instant specification shows experimental results demonstrating that ING2, like INGl, 
cooperates with the well known tumor suppressor p53 in activating transcription of p53- 
inducible promoters (Fig. 12; page 7, lines 8-10). The claims encompass ING2 proteins that 
are encoded by a nucleotide sequence of SEQ ID N0:7 (see, e.g., page 5, lines 15-16 and Fig. 
7) as well as ING2 proteins having at least 95% sequence identity to the amino acid sequence 
of SEQ ID N0:8 (see, e.g., the polypeptide encoded by SEQ ID N0:7; page 5, lines 17-18 
and Fig. 8). 

Claim 26 recites a recombinant ING2 protein encoded by the contiguous 
polynucleotide sequence of nucleotides 120-845 of the nucleic acid set forth in SEQ ED N0:7 
(Fig. 7). 



3 



Atty Dkt. No.: RIGL-008CIP 
USSN: 09/715,725 



Claim 27 recites a recombinant ING2 protein comprising an amino acid sequence 
having at least 95% sequence identity to the contiguous sequence set forth in SEQ ID N0:8 
(page 7, lines 31-35), where the recombinant ING2 protein increases activity of a promoter 
having a p53 binding site v^hen introduced into a mammalian cell (page 7, lines 8-10 and 
page 33, lines 5-6). 

Claim 29 recites a recombinant ING2 protein consisting essentially of the contiguous 
amino acid sequence set forth SEQ ID N0:8 (Fig. 8). 

Claim 30 is dependent on claim 29 and recites a recombinant ING2 protein that 
increases activity of a promoter having a p53 binding site when introduced into a mammalian 
cell (page 7, lines 8-10 and page 33, lines 5-6). 

Claim 32 is dependent on claim 27 and recites a recombinant ING2 protein that has 
the contiguous amino acid sequence set forth in SEQ ID NO: 8 (Fig. 8). 
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Grounds of Rejection to be Reviewed on Appeal 

1. Rejection of claims 26-27, 29-30 and 32 under 35 U.S.C. § 112. HI renablement)^ 
If the Appellant's understanding of this eight page rejection is correct, the Examiner 

argues that the specification does not enable one of skill in the art to use the invention 
because the asserted activity of the claimed ING2 polypeptide is questionable. 

2. Rejection of claims 27, 30 and 32 under 35 U.S.C. § 112, 111 (written description)^ 

In attempting to establish this rejection, the Examiner argues that an ING2 protein that 
"increases activity of a promoter having a p53 binding site" is not described in the 
specification as filed. This is a new matter rejection. 

3. Rejection of claim 27 under 35 U.S.C. § 1 12, fist paragraph (written description)^ 
Citing Lilly^ and Enzo^, the Examiner argues that a genus of ING2 proteins that 

comprise an amino acid sequence having at least 95% identity to SEQ ID NO:8 and that 
increase activity of a promoter having a p53 binding sites are inadequately described by the 
instant specification. 

4. Rejection of claim 27 under 35 U.S.C. § 1 12, fist paragraph (enablement)^ 

In a nutshell, the Examiner attempts to establish this rejection by arguing that one of 
skill in the art would not know which amino acids of SEQ ID NO: 8 could be changed without 
changing the fimction of the protein, and thus he could not practice the full scope of the 
claims without undue experimentation. 

^ See Office Action mailed August 10, 2005, at pages 2-10, item 3. 
^ See Office Action mailed August 10, 2005, at pages 10-12, item 4. 
^ See Office Action mailed August 10, 2005, at pages 12-17, item 5. 

University of California v. Eli Lilly and Co., 119 F.3d 1559. 43 USPQ2d 1398 (Fed. Cir. 1997) 
^ Enzo Biochem, Inc, v, Gen-Probe Inc, 269 F.3d 1316. 63 USPQ2d 1609 (Fed. Cir. 2002) 
® See Office Action mailed August 10, 2005, at pages 17-19, item 8. 
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Arguments 

L Rejection of claims 26-27. 29-30 and 32 under 35 U.S.C. § 112. first paragraph 
(enablement) 

Claims 26-27, 29-30 and 32 are rejected as failing to meet the enablement requirement 
of 35 U.S.C. § 112, first paragraph. Summarizing an eight page rejection, the Examiner 
argues that the claimed ING2 polypeptides have questionable utility and, as such, the 
specification does not enable one of skill in the art to use the claimed subject matter. This 
enablement rejection is grounded on a "lack of utility" basis. 

MPEP § 2 164. 071. A very clearly states, however, that if there is no basis for rejecting 
claims under 35 U.S.C. § 101, there is also no basis for an enablement rejection grounded on 
a "lack of utility" basis.^ In other words, according to MPEP § 2164.07I.A, this rejection 
should not be imposed unless it is also accompanied by a rejection under 35 U.S.C. § 101, 

Claims 26-27, 29-30 and 32 are not rejected under 35 U.S.C. § 101. There is no 
question about the utility of the subject matter of the claims. 

Since there is no accompanying rejection under 35 U.S.C. § 101, in accordance with 
MPEP § 2 164. 071. A, this rejection lacks merit and should be reversed. 

In attempting to establish this rejection, the Examiner asserts there is no nexus 
between SEQ ID NO: 8 and either ING2b or ING2c (data is provided for ING2b (SEQ ID 
N0:4) and ING2c (SEQ ID N0:6) showing activation of p53, a well known tumor 
suppressor), and thus SEQ ID NO: 8 has no use.^ The grounds for this enablement rejection 
are substantially identical to the grounds to support the prior utility rejection under §101,^ 
which was withdrawn in view of Applicants ' arguments earlier in prosecution. 

The Examiner's maintenance of a rejection under 35 U.S.C. §112, first paragraph, on 
the same grounds that were the basis of a rejection under 35 U.S.C. § 101 both inconsistent 

^ MPEP § 2164.07I.A: "Office personnel should not impose a 35 U.S.C. 112, first paragraph, rejection 
grounded on a "lack of utility basis unless a 35 U.S.C. 101 rejection is proper." 

® Office Action mailed August 10, 2005, page 3, lines 10-11 and page 4, lines 21-23. 
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with the law, and the Patent Office requirement for Examiners to efficiently examine 
appHcations and avoid "piecemail" prosecution.^^ 

As detailed in prior responses, SEQ ID NOS: 2, 4, 6, 8 and 10, as well as p28ING5 
(see, Shiseki et al Cancer Research 63: 2373-2378, 2003) are spUce variants encoded by the 
same gene, and all share a 200 amino acid domain. As shown in Fig. 12 of the instant 
specification and in Shiseki et al, each of ING2b, ING2c and p28ING5 regulate expression 
from p53 binding site promoters. Li view of this data and the high relatedness of the members 
of the ING2 family, one of skill in the art would reasonably conclude that SEQ K) NO: 8 has 
uses that are enabled by the instant specification, e.g., in screening assays for identifying 
agents that modulate p53 activity and the cell cycle (see, e.g., page 35, line 32 to page 36, line 
19). 

The utility of SEQ ID NO: 8 is not in any doubt. Since this rejection is based on the 
Examiner's notion that the claimed subject matter has questionable utility, this rejection 
should be reversed. 

Reversal of this rejection under 35 U.S.C. § 112, first paragraph (enablement based on 
lack of utility) is respectfully requested. 

2. Rejection of claims 27. 30 and 32 under 35 U.S.C. $ 112, ^1 (written description) 

Claims 27, 30 and 32 are rejected as failing to meet the written description 
requirement of 35 U.S.C. § 1 12, first paragraph. The basis for this rejection is the Examiner's 
belief that the claims 27, 30 and 32 recite subject matter that is broader than as described in 
the specification as originally filed. 

The written description requirement of 35 U.S.C. § 112, first paragraph, involves the 
question of whether the subject matter of a claim conforms to the disclosure of an application 
as filed. According to the MPEP, an objective standard for determining compliance with the 
written description requirement is, "does the description clearly allow persons of ordinary 



^ See Office Action mailed August 8, 2003, pages 4-12. 

^° MPEP § 707.07(g): Piecemeal examination should be avoided as much as possible. 
See Office Action dated August 10, 2005, last paragraph. 
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skill in the art to recognize that he or she invented what is claimed?" The subject matter of 
the claim need not be described literally (i.e. using the same terms or in haec verba) in order 
for the disclosure to satisfy the written description requirement*^. 

According to page 1 1 of the Office Action, the Examiner appears to believe that the 
phrase "wherein said recombinant ING2 protein increases activity of a promoter having a p53 
binding site", is not supported in the specification as filed. 

The Appellants submit, however, that there is extensive and explicit support for ING2 
proteins that increase the activity of a promoter having a p53 binding site at several places in 
the specification. 

For example, support for such ING2 proteins can be found at: 

■ page 7, lines 8-9, stating that "ING2 activates p53 binding site controlled 
promoters in the presence of absence of p53"; 

■ page 33, lines 5-6, stating that cell cycle proteins (of which ING2 is an 
example) "activate p53 binding site controlled promoters"; 

■ page 37, line 3, stating that assay employing an ENG2 polypeptide can include 
measuring "activation of p53 binding site controlled promoters"; and 

■ Fig. 12, which shows data demonstrating that ING2 proteins (of which SEQ 
ID NO: 8 is an example) increases transcriptional activation by p53 in a 
mammahan cell. 

In view of the above, the Appellants submit there is adequate support for the phrase 
"wherein said recombinant ING2 protein increases activity of a promoter having a p53 
binding site". 



See MPEP § 2163.02, citing In re Gosteli 872 F.2d 1008, 1012 (Fed. Cir. 1989). 
^^See MPEP § 2163.02. 



8 



Atty Dkt. No.: RIGL-008CIP 
USSN: 09/715,725 



The Appellants submit that the current claims recite subject matter that is no broader 
than as described in the specification as originally filed. As such, the Appellants believe that 
the written description requirement of 35 U.S.C. § 112, first paragraph, has been satisfied, and 
no new matter is added. 

Reversal of this rejection is requested. 

3. Rejection of claim 27 under 35 U.S.C. § 112. first paragraph (written description) 

Claim 27 is rejected as not meeting the vmtten description requirement of 35 U.S.C. § 
112, first paragraph. In attempting to support this rejection, the Examiner argues that a genus 
of ING2 proteins that comprise an amino acid sequence having at least 95% identity to SEQ 
ID N0:8 and that increase activity of a promoter having a p53 binding sites are inadequately 
described by the instant specification. 

The Appellants submit that the fact pattem in the instant case falls squarely into the 
fact pattem of Example 14 of the Synopsis of AppUcation of Written Description 
Guidelines'^ ("the Guidelines"; copy enclosed as Exhibit A), 

The Appellants understand that he Patent Office has instructed Examiners to adhere to 
the law as summarized in the Guidelines. While certainly the Guidelines would be 
subservient to the law, the Examiner has cited no Board, Federal Circuit or Supreme Court 
decision that renders the Guidelines inapplicable. Indeed, the Federal Circuit itself has 
pointed to these very Guidelines in recent decisions in assisting the analysis of written 
description issues.'^ As such, the Guidelines should be followed, not ignored. 

The Appellants submit that if the Guidelines are followed, claim 27 meets the written 
description requirement of §112, first paragraph. 

Example 14 of the Guidelines describes a scenario very similar to that currently under 
examination. 



"Synopsis of Application of Written Description Guidelines", as published to the world wide website of the 
U.S.P.T.O. on March 1'*, 2000. 

See Enzo Biochem v Gen-Probe 296 F3d 1316 (Fed. Cir. 2002) 
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In Example 14, the claim is directed to "A protein having SEQ ID NO: 3 and variants 
thereof that are at least 95% identical to SEQ ID NO: 3 and catalyze the reaction of A->B." 
The language of the claim of Example 14 is almost identical to rejected claim 27. The 
Example 14 claim and claim 27 recite a specific sequence and a functional activity. The claim 
of Example 14 and claim 27 are therefore of similar in type and kind. 

The specification of Example 14 discloses a polypeptide sequence of SEQ ID NO: 3 as 
having a certain activity/^ The specification of Example 14 also "contemplates but does not 
exemplify" variants of SEQ ID N0:3, and provides an assay for measuring the activity of the 
protein. 

Again, the fact pattern of Example 14 is very similar to the instant fact pattern. The 
instant specification: a) describes the sequence of a full length polypeptide (i.e., SEQ ID 
N0:8), b) describes that SEQ ID N0:8 has lAP (for ""/nhibitor of ^poptosis Protein") 
binding activity (see page 33, lines 4-5) and p53-modulatory activity (see, e.g.. Fig. 12; page 
7, lines 8-10); and c) provides detailed methods of how lAP binding activity and p53 
modulatory activity can be assayed (see e.g., page 42, lines 16-27 and Fig. 12 of the instant 
specification). In addition, a total of five similar ING2 isoforms, ING2A, ING2B, ING2C, 
ING2D and ING2E (corresponding to SEQ ID NOS: 2, 4, 6, 8 and 10, respectively) are 
described in the instant specification. The amino acid sequences of these proteins are shown 
in the sequence alignment of Fig. 1 1 . As such, the Applicant's claims are supported by five 
examples, of ING2 proteins, not one. 

The Guidelines state that the claim of Example 14 is adequately described by the 
specification and the requirements of 35 USC §112 first paragraph have been met because: 

"The single species disclosed is representative of the genus because all 
members have at least 95% structural identity with the reference compound 
and because of the presence of an assay which applicant provided for 
identifying all of the at least 95% identical variants of SEQ ID NO: 3 which 
are capable of the specified catalytic activity." 

The claim of Example 14 and rejected claim 27 contain limitations that are similar in 
type and kind. The specification of Example 14 and the instant specification both describe a 



SEQ ID NO: 3 encodes a polypeptide having ligase activity 
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representative sequence, as well as a functional activity and an assay by which the activity can 
be measured. 

In view of the similarity between the fact pattern of the Example 14 and the fact 
pattern described in Example 14 of the Guidelines, the Appellants submit that claim 27, like 
the claim of Example 14, meets the written description requirement of §112,^1. 

The Patent Office Guidelines are provided to Examiners for good reason: to promote 
consistent application of the law to claims of similar subject matter. This not only serves the 
Patent Office's goal of efficient prosecution, but also provides applicants with guidance as to 
how they can reasonably expect the law to be applied. Here, the Examiner has elected to 
wholly ignore the Patent Office's own guidelines, resulting in protracted prosecution of the 
instant claims and increased cost to the Applicants. 

In rendering its decision, the Board is respectfully requested to clear that, to the extent 
the Guidelines are and remain good law, Examiners should apply them. 

The Applicants finally note that reversal of this rejection would be consistent with 
recent decisions by the Board of Patent Appeals and Interferences of the United States Patent 
and Trademark Office. The decisions are and Ex parte Bandman BAPI Appeal No. 2004- 
2319 (2004) and Ex parte Sun BAPI Appeal No. 2003-1993 (2003), among others. The genus 
claims that are the subject of in these decisions were supported by disclosure of ei single 
representative species encompassed by the claims. Since the instant claims are supported by 
five examples, the Applicants submit that the instant claims should well satisfy criteria used 
by the Board for withdrawing this type of rejection. 

Reversal of this rejection is respectfully requested. 

4. Rejection of claim 27 under 35 U.S.C. § 112. fist paragraph (enablement) 

Claim 27 is rejected as not meeting the enablement requirement of 35 U.S.C. §112, 
first paragraph. In attempting to support this rejection, the Examiner argues that a genus of 
ING2 proteins that comprise an amino acid sequence having at least 95% identity to SEQ ID 
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N0:8 and that increase activity of a promoter having a p53 binding sites are not enabled by 
the instant specification. 

In order to satisfy the enablement requirement of 35 U.S.C. § 1 12, first paragraph, the 
Office is reminded that extensive experimentation may be performed, as long as the 
experimentation is routine, and that every species within a genus does not have to be 
operative for a claim to be fiilly enabled/^ 

This rejection is based on the idea that biological systems are complex and 
unpredictable and, as such, one of skill in the art would not know which amino acids of the 
claimed ING2 protein could be changed without changing the fimction of the protein. 

The Appellants are aware that biological systems can, in certain circumstances, be 
complex and unpredictable. The Appellants submit, however, that the instant specification, 
combined with what is already known about related the related INGl, which is a highly 
characterized protein and has a fiinction similar to ING2, provides sufficient guidance as to 
which amino acids could be changed in ING2 in order for it to remain functional. As such, the 
Appellants believe that no undue experimentation would be required to practice what is being 
claimed. 

Factual support for the Appellants position is set forth below: 
With regard to ING2 protein domains that are important for ING2 function, a skilled 
person would look towards Fig. 1 1 of the specification, and what is known in the art about the 
structure and function of the highly related INGl protein, which is shown aligned with the 
subject ING2 isoforms in Fig. 1 1, for guidance. Fig. 1 1 of the instant application is attached 
hereto as Appendix B. 

Fig. 1 1 shows an alignment of the subject ING2 isoforms with other ING proteins, 
including INGl. INGl, as described on page 7 of the instant specification, is highly 
characterized and has a role in apoptosis that appears to be similar to that of ING2. Residues 
that are conserved between two or more proteins shown in this alignment as highlighted in 
black boxes. This alignment therefore shows several regions that are conserved between the 
various ING proteins, including, in particular, several conserved amino acids at the C- 

In re Certain Limited-Charge Cell Culture Microcarriers, 221 USPQ 1 165, 1 174 (Int'l Trade Comm'n 
1983), affd sub nom., Massachusetts Institute of Technology v, A.B. Fortia, 227 USPQ 428 (Fed. Cir. 
1985). Hybritech v. Monoclonal Antibodies, Inc. 231 USPQ 81 (Fed. Cir. 1986) 
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terminus of the protein, and a large region of amino acids at the C-terminus of the protein, 
starting from the "DPNEPTY. . and finishing at " TTKPKGKW". 

Since INGl and ING2 are functionally similar and contain several regions of 
conserved amino acids, including a large block of conserved amino acids at their C-termini, a 
skilled person would instantly recognize that these domains may be important for function of 
ING proteins. Moreover, the fact that these domains are conserved among the different ING2 
isoforms themselves further suggests to the skilled person that these domains are less 
preferred for introduction of amino acid changes, particularly those non-conservative amino 
acid changes. A skilled person would generally avoid making amino acid changes in these 
regions when designing EISfG2 variants that are at least 95% to SEQ JD N0:8. 

With the knowledge that INGl and ING2 have apoptosis activity and have conserved 
domains, a skilled person would, for example, swap domains from INGl to ING2 (i.e. SEQ 
ID NO: 8) with an expectation that the resultant protein would retain an apoptosis activity. 

Further, a skilled person, upon viewing the instant specification, particularly Figure 1 1 
would have knowledge of the prior art regarding the structure/function relationship of ING 
proteins, and would apply this knowledge in the production of claimed ING2 variants. For 
example, one of skill in the art would have knowledge of all of the papers referenced on page 
7, lines 22-24, of the instant specification, i.e., Helbing et aL, A novel candidate tumor 
suppressor, INGl, is involved in the regulation of apoptosis. Cancer Res. 1997 57:1255-8; 
Garkavtsev et al.. Suppression of the novel growth inhibitor pSSINGl promotes neoplastic 
transformation, Nat Genet. 1996 14:415-20; Garkavtsev et al., The candidate tumour 
suppressor pSSINGl cooperates with p53 in cell growth control Nature. 1998 Jan 
15;391(6664):295-8; and Shimada et al.. Cloning of a novel gene (INGIL) homologous to 
INGl, a candidate tumor suppressor Cytogenet Cell Genet. 1998 83:232-5. 

In particular, Zeremski (J. Bio. Chem. 1999 Structure and regulation of the mouse 
ingl gene. Three alternative transcripts encode two phd finger proteins that have opposite 
effects on p53 function 274:32172-32181; copy enclosed as Exhibit B), discusses several 
amino acids that are conserved between ING proteins (see Fig. 4B on page 32177 and 
abstract), and indicates that the conserved C-terminal domain is a "PHD" DNA binding 
domain (see page 32180, second column). Most of the conserved amino acids of Zeremski's 



13 



Atty Dkt. No.: RIGL-008CIP 
USSN: 09/715,725 



sequence alignment are also indicated as being conserved in the alignment of Fig. 11. While 
not relied upon to make this assertion, Zeremski confirms the conserved amino acids 
identified in Fig. 11, and assigns an art-recognized domain name and function to the 
conserved C-terminal region. 

A skilled person would also look towards the sequence alignment shovra in Fig. 1 1, as 
well what is known in the art about the structure of the INGl protein for guidance in making 
variants of ING2. 

Fig. 1 1 is reproduced below. 
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As shown in Fig. 11, ING2 proteins are highly related to INGl, a highly characterized 
protein having a function similar to SEQ E) N0:8.^^ When provided the sequence alignment 
Figure 1 1, the skilled person, who also has extensive knowledge of several publications on 

^® ING1 in the prior art, but is not within the scope of claim 27. 
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the structure/function relationship of INGl proteins, would note the conserved and non- 
conserved regions and amino acids between the various ING proteins in the alignment and 
readily use this knowledge to produce functional variants of SEQ ID NO: 8. For example, the 
skilled person would recognize a large number of amino acids in an ING2 protein having the 
sequence of SEQ ID NO: 8 that may be substituted, and reasonably expect that these 
substitutions would have no significant effect on its function. 

The bottom line of the ING sequences shown in Figure 1 1 shows a "consensus" 
sequence. This consensus sequence shows amino acids that are conserved in all ING proteins 
shown (indicated in upper case letters). Residues conserved in more than two ING proteins 
(but not all ING proteins) are shown in lower case letters. Since all the ING proteins shown in 
the figure have a conserved function, a skilled person would recognize that, in order to make 
ING2 variants, amino acids at positions that correspond to upper case letters in the consensus 
sequence should would be less preferred sites for modification, and that amino acids that 
correspond to lower case letters in the consensus sequence may be modified to any other 
amino acid at that position. 

Further, at positions in which there is no amino acid listed in the consensus sequence, 
a skilled person may be able to choose any amino acid at that position. A skilled person 
would also recognize that certain amino acids may be substituted by another amino acid 
having a similar property. For example, D (Asp) and E (Glu) are acidic resides and may be 
substituted by each other. Likewise, it is well knows that K (Lys) and R (Arg) are basic 
residues and may be substituted by each other, S (Ser) and T (Thr) are polar, uncharged, 
hydroxyl-containing residues and may be substituted by each other, and that L (Leu), I (He) 
and V (Val) non-polar residues that may be substituted by each other. 

In other words, by simply looking at the sequence alignment shown in Fig. 1 1, a 
skilled person would instantly recognize a large number of amino acids in an ING2 protein 
having the sequence of SEQ ID N0:8 that may be substituted, and reasonably expect that 
these substitutions would have no significant effect on ING2 function. 

Finally, the Appellants note that the specification provides working examples of five 
ING2 isoforms, ING2A, ING2B, ING2C, ING2D and ING2E (corresponding to SEQ ID 
NOS: 2, 4, 6, 8 and 10, respectively) are described in the instant appUcation. ING2A, ING2B, 
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ING2C and ING2E share 87.6%, 94.2%, 94.2% and 81.8% amino acid sequence identify to 
ING2D (corresponding to SEQ ID N0:8), respectively. Considering each of ING2A, ING2B, 
ING2C, ING2D and ING2E bind lAP and can induce p53 expression (see Fig. 12 of the 
instant specification), a skilled person would reasonably expect ING2 variants with at least 
95% identity (i.e., significantly greater than 81.8% identity) to SEQ ID N0:8 to have an 
activity similar to that of SEQ ED N0:8. 

In summary, the specification provides adequate evidence that one can substantially 
modify the amino acid sequence of an ING2 protein, without loss of function. Since DSfGl 
(which has as little as 81.8% identity to an ING2 protein) has a function similar to that of 
ING2, a skilled person would recognize that ING proteins with at least 95% sequence identity 
to an ING2 protein could be made and used without undue experimentation. 

In view of the foregoing discussion, the Appellants submit that the instant 
specification provides sufficient guidance as to which amino acids could be changed in an 
ING2 variant having the sequence of SEQ ID NO: 8 in order for it to remain functional. As 
such, the Applicants believe that no undue experimentation is required to practice the subject 
matter of claim 17, and this rejection should be reversed. 

Reversal of this respectfully requested. 

Summary 

The Appellants submit that all current rejections lack merit. 
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Relief Requested 

The Appellants respectfully request that all of the rejections be reversed, and that the 
application be remanded to the Examiner with instructions to issue a Notice of Allowance. 
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Claims Appendix 

1-25. (Cancelled) 

26. A recombinant ING2 protein encoded by the contiguous polynucleotide sequence of 
nucleotides 120-845 of the nucleic acid set forth in SEQ ID N0:7. 

27. A recombinant ING2 protein, comprising an amino acid sequence having at least 95% 
sequence identity to the contiguous sequence set forth in SEQ ID NO: 8, wherein said 
recombinant ING2 protein increases activity of a promoter having a p53 binding site when 
introduced into a mammalian cell. 

28. (Cancelled) 

29. A recombinant ING2 protein, consisting essentially of the contiguous amino acid 
sequence set forth SEQ ID N0:8. 

30. The recombinant ING2 protein of claim 29, wherein said ING2 protein increases 
activity of a promoter having a p53 binding site when introduced into a mammalian cell. 

31. (Cancelled) 

32. The isolated protein of claim 27, wherein said isolated protein has the contiguous 
amino acid sequence set forth in SEQ ID N0:8. 
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AttyDkt. No.: RIGL-008CIP 
USSN: 09/715,725 

Evidence Appendix 

No evidence submitted under 37 CFR §§ 1.130 or 1.131 has been relied upon by 
Appellants in this Appeal. 

Exhibits A and B are appended hereto. 

Exhibit A: Example 14 of the "Synopsis of Apphcation of Written Description 
Guidelines", as published to the world wide website of the U.S.P.T.O. on March 1^ 2000. 

Exhibit B: Zeremski et al. (J. Biol. Chem 274:32172-32181, 1999). This reference is 
of record. 



Atty Dkt. No.: RIGL-008CIP 
USSN: 09/715.725 



Related Proceedings Appendix 
There are no decisions rendered by a court or the Board which would directly affect or be 
directly affected by, or have a bearing on the Board's decision in the instant appeal. 
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Example 14: Product bv Function 

Specification: The specification exemplifies a protein isolated from liver 
that catalyzes the reaction of A — ► B. The isolated protein was sequenced 
and was determined to have the sequence as set forth in SEQ ID NO: 3. The 
specification also contemplates but does not exemplify variants of the 
protein wherein the variant can have any or all of the foUowing: 
substitutions, deletions, insertions and additions. The specification mdicates 
that procedures for making proteins with substitutions, deletions, insertions 
and additions is routine in the art and provides an assay for detecting the 
catalytic activity of the protein. 



A protein having SEQ ID NO: 3 and variants thereof that are at least 95% 
identical to SEQ ID NO: 3 and catalyze the reaction of A ^ B. 



Analysis: 

A review of the fiill content of the specification indicates that a 
protein having SEQ ID NO: 3 or variants having 95% identity to SEQ ID 
NO: 3 and having catalytic activity are essential to the operation of the 
claimed invention. The procedures for making variants of SEQ ID NO: 3 
are conventional in the art and an assay is described which will identify 
other proteins having the claimed catalytic activity. Moreover, procedures 
for making variants of SEQ ID NO: 3 which have 95% identity to SEQ ID 
NO: 3 and retain its activity are conventional in the art. 



Claim: 
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A review of the claim indicates that variants of SEQ ID NO: 3 include 
but are not limited to those variants of SEQ ID NO: 3 with substitutions, 
deletions, insertions and additions; but all variants must possess the specified 
catalytic activity and must have at least 95% identity to the SEQ ID NO: 3. 
Additionally, the claim is drawn to a protein which comprises SEQ ID NO: 
3 or a variant thereof that has 95% identity to SEQ ID NO: 3. In other 
words, the protein claimed may be larger than SEQ ID NO: 3 or its variant 
with 95% identity to SEQ ID NO: 3. It should be noted that "having" is 
open language, equivalent to "comprising". 

The claim has two different generic embodiments, the first being a 
protein which comprises SEQ ID NO: 3 and the second being variants of 
SEQ ID NO: 3. There is a single species disclosed, that species being SEQ 
ID NO: 3. 

A search of the prior art indicates that SEQ ID NO: 3 is novel and 
unobvious. 

There is actual reduction to practice of the single disclosed species. 
The specification indicates that the genus of proteins that must be variants of 
SEQ ID NO: 3 does not have substantial variation since all of the variants 
must possess the specified catalytic activity and must have at least 95% 
identity to the reference sequence, SEQ ID NO: 3. The single species 
disclosed is representative of the genus because all members have at least 
95% structural identity with the reference compound and because of the 
presence of an assay which applicant provided for identifying all of the at 
least 95% identical variants of SEQ ID NO: 3 which are capable of the 
specified catalytic activity. One of skill in the art would conclude that 
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applicant was in possession of the necessary common attributes possessed 
by the members of the genus. 

Conclusion: The disclosure meets the requirements of 35 USC §112 first 
paragraph as providing adequate written description for the claimed 
invention. 
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The human INGl gene encodes nuclear protein 
p33^^^, previously shown to cooperate with p53 in cell 
growth control (Garkavtsev, L, Grigorian, I. A., Oss- 
ovskaya, V. S., Chernov, M. V., Chumakov, P. M., and 
Gudkov, A- V. (1998) Nature 391, 295-298). p33^^^ be- 
longs to a small family of proteins from human, mouse, 
and yeast of approximately the same size that show 
significant similarity to one another within the C-termi- 
nal PHD finger domain and also contain an additional 
N-terminal region with subtle but reliably detectable 
sequence conservation. Mouse ingl is transcribed from 
three differently regulated promoters localized within a 
4-kilobase pair region of genomic DNA. The resulting 
transcripts share a long common region encoded by a 
common exon and differ in their 5'-exon sequences. Two 
transcripts are translated into the same protein of 185 
amino acids, the mouse equivalent of the human 
pgg/ATGj^ while the third transcript encodes a longer pro- 
tein that has 94 additional N-terminal amino acids. 
Overexpression of the longer protein interferes with the 
accumulation of p53 protein and activation of p53-re- 
sponsive promoters after DNA damage. Between the two 
products of ingl, only the longer one forms a complex 
with p53 detectable by immunoprecipitation. These re- 
sults indicate that a single gene, ingl, encodes both p53- 
suppressing and p53-activating proteins that are regu- 
lated by alternative promoters. 



The INGl gene was identified as a result of a functional 
screening of genes, the suppression of which is associated with 
neoplastic transformation (1). Inhibition of INGl expression by 
antisense RNA promotes anchorage-independent growth in 
mouse breast epithelial cells, increases the frequency of focus 
formation in NIH 3T3 cells, and prolongs the life span of diploid 
human fibroblasts in culture. INGl expression is up-regulated 
in senescent human fibroblasts (2), and ectopic expression of 
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INGl cDNA leads to Gj sirrest or promotes apoptosis in several 
cell types (3). The accumulated observations indicating INGl 
participation in the negative regulation of cell proliferation, 
control of cellular aging, and apoptosis have defined INGl as a 
candidate tumor suppressor gene. 

We have recently found that the biological effects of INGl 
and p53 are interrelated and require the activity of both genes. 
Neither of the two genes can, on its own, cause growth inhibi- 
tion when the other one is suppressed (4). Furthermore, acti- 
vation of transcription from \he p21/WAFl promoter, a key 
mechanism of p53 -mediated grov^rth control, depends on the 
expression of INGl . A physical association between p33^^^^ 
and p53 proteins is detected by immunoprecipitation. These 
results indicated that p33^^'^^ is a component of the p53 sig- 
naling pathway and that p33^^*^^ cooperates with p53 in neg- 
ative regulation of cell proliferation by modiilating p53-depend- 
ent transcriptional activation. 

Despite the apparent importance of INGl in the control of 
cell proliferation, our knowledge of expression, regulation, and 
function of this gene remains incomplete. Moreover, Gen- 
Bank™ contains two INGl mRNA sequences differing in their 
5 '-ends. The origin of these differences is unknown and re- 
quires explanation. So far, all of the information about the 
function of p33^^'^^ was obtained from in vitro experiments 
that involved ectopic expression of INGl cDNA or its suppres- 
sion by antisense RNA. The analysis of the structure of the 
INGl gene and its regulation in vivo are essential steps toward 
the understanding of its function and involvement in develop- 
mental and physiological processes. This is particularly impor- 
tant due to the cooperation between INGl and p53, which 
suggests that the fimctioning of the p53 signaling pathway 
could be dependent on the regulation of INGl expression. 

In the present work, we used the mouse ortholog of the 
human INGl gene for detailed structural and expression stud- 
ies, v^th the goal of subsequently utilizing it as a system for 
extensive genetic analysis, ingl was found to be a highly evo- 
lutionarily conserved gene with complex regulation, which in- 
volves generation of alternative transcripts initiated from dif- 
ferent promoters and translated into proteins that differ in 
structure and expression patterns. Moreover, these proteins 
have opposite effects on p53-regulated transcription, indicating 
that ingl encodes two products with the properties of a puta- 
tive tumor suppressor and a putative oncogene. 

MATERIALS AND METHODS 

Plasmids and Libraries — Retroviral vector pLXSN, used for the in- 
troduction of mouse and human INGl cDNA in NMuMG, 10(1), and 
ConA cells, was provided by A. Dusty Miller (5). Retroviral vector 
pLXIG was constructed on the basis of pLXSN vector by substituting 
SV40 promoter and neo sequences with the internal ribosome entry site 
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of encephalomyocarditis virus (6) followed by the enhanced green fluo- 
rescent protein sequence (7). This vector permits us to translate both 
the gene of interest (cloned upstream of the internal ribosome entry 
site) and enhanced green fluorescent protein from a single bicistronic 
mRNA transcribed from Moloney murine leukemia virus long terminal 
repeat promoter. A cDNA library from senescent mouse embryonic 
fibroblasts was constructed using Superscript system (Life Technolo- 
gies, Inc.) according to the manufacturer's protocol and cloned into the 
A phage A gt22A vector. Stratagene's 129SVJ mouse genomic library 
cloned into the A FIX II vector was used for the isolation of the mouse 
genomic, in^i -containing clones. 

Cell Lines — Cultures of mouse embryonic fibroblasts were obtained 
from 10-day-old embryos. 10(1) cell line, a derivative of Balb/c 3T3 cells 
that spontaneously deleted both p53 alleles (8), was kindly provided by 
Arnold Levine. Pseudonormal mouse mammary gland epithelial cell 
line (NMiiMG) was obtained from the ATCC collection. Ecotropic ret- 
roviral packaging cell line BOSC23 (9) was kindly provided by Warren 
Pear and David Baltimore (Massachusetts Institute of Technology). The 
ConA cell line, a derivative of BALB/c 3T3 cells 12-1 (8) with wild type 
p53, was described earUer (10). It carries the lacZ gene encoding ^-ga- 
lactosidase Escherichia coli under the control of the p53-dependent 
promoter and therefore allows monitoring p53 transcriptional activa- 
tion by a routine X-gal staining. AH cells were maintained in Dulbecco's 
modified Eagle's medium containing 107o fetal bovine serum supple- 
mented with penicillin and streptomycin. For the serum starvation 
experiments, 10(1) and NMuMG cells were kept with 0.5% fetal bovine 
serum for 48 h (NMuMG cells) or 36 h (10(1) cells). For the contact 
inhibition experiments, cells cultures were used 48 h (NMuMG) or 36 h 
(10(1)) after they became confluent NMuMG and 10(1) cells were 
irradiated with 10 grays of y-radiation and used 24 h after treatment. 
Mouse embryo fibroblasts were propagated until they reached senes- 
cence. Populations of ConA cells, expressing different levels of p37^^'^', 
were generated by transduction with pLXIG vector, carrying p37^^^^ 
cDNA, followed by fluorescence- activated cell sorting of cells with dif- 
ferent levels of fluorescence. 

Animals — Organs and embryos of FVB/N mice were used for the 
RNA isolation and preparation of whole body sections of embryos using 
a cryostatic microtome for histoblot hybridization. 

Hybridization Screening of cDNA and Genomic Libraries — cDNA 
and genomic library screenings were done according to the standard 
protocols (11) using ^^P-labeled human and mouse ingl cDNA probes, 
respectively. 

5'- and 3'-RACE^ — Alternative 5 '-ends of the mouse ingl were iso- 
lated from mouse spleen and mouse brain cDNAs using the Marathon- 
ready cDNA kit (CLONTECH), according to the protocol suggested by 
the manufacturer. API adaptor- specific sense primer, pro\'ided by 
CLONTECH, and the i/i^i-specific antisense primer (5'-CCATCT- 
GACTCACGATCTGGATCTTC-3') were used for PGR. Nested PGR was 
performed using AP2 adaptor-specific sense primer, provided by CLON- 
TECH, and ttt,^i-specific antisense primer (5'-CTGCGGATCA(XJGC- 
CCTCTGK>ATGC-3'). Precise determination of the 5'- and 3 '-ends of the 
mouse ingl transcripts was done usiug the Marathon-2 cDNA amplifi- 
cation Kit based on the new SMART PCR cDNA synthesis technology 
(CLONTECH). It is based on the recently identified ability of Moloney 
murine leukemia virus reverse transcriptase to add several nucleotides 
to the 3' terminus of first-strand cDNA during the reverse transcriptase 
reaction (12). Briefly, when reverse transcriptase reaches the 5 '-end of 
the mRNA, it switches templates and continues synthesizing the 
SMART template-switching oligonucleotide. The resulting single- 
stranded cDNA contains the complete 5 '-end of the mRNA as well as 
the sequence complementary to the template-switching oligonucleotide 
and is then selectively ampUfied by PCR. In these experiments, poly(A) 
RNA preparations isolated from the th3Tnus and testis were used. The 
following sequences were used for the synthesis of antisense ingl- 
specific primers: 5'-A<^TGTGGTCKX>ATC(XX;AAC(jC-3' (for isoform 
la), 5'-CGCGG0GAGCCAGAGCAGAGAAGGT-3' (isoform Ic), and 5'- 
GGCGTGGCCTGTCATTGTCGCTG-3' (isoform lb), in^i-specific sense 
primer 5'-GCGTGCTTCTTGCTACCAT-3' was used for the PCR ampU- 
fication of the 3 '-end. 

Sequence Analysis — Sequencing was done using a Sequenase version 
2.0 DNA sequencing kit (U.S. Biochemical Corp.) or done by the Uni- 
versity of Chicago Cancer Research Center DNA Sequencing Facility. 
In all cases, both strands were read using multiple vector-specific or 



^ The abbreviations used are: RACE, rapid amplification of cDNA 
ends; PCR, polymerase chain reaction; X-gal, 5-bromo-4-chloro-3-indo- 
lyl |3-D-galactopyranoside. 



gene-specific primers. Protein sequence data base searches were per- 
formed using the gapped BLASTP program and the PSI-BLAST pro- 
gram that iterates the search using profiles constructed from BLAST 
hits as queries for subsequent iterations (13). Multiple sequence align- 
ments were constructed using the Gibbs sampling option of the MA- 
CAW program (14, 15). GenBank*^^ accession numbers for the de- 
scribed sequences are AF177753-AF177757. 

Southern, Northern, and Western analyses were done according to 
standard protocols. IgGl mouse monoclonal antibody against human 
recombinant p33^^*^'' (16) was used for the detection of in^i -encoded 
proteins. Anti-p53 monoclonal mouse IgG antibody Ab-1 was obtained 
from Calbiochem. p21 rabbit polyclonal IgG antibody was obtained from 
Santa Cruz Biotechnology, Inc. (Santa Cruz, CA). 

Immunoprecopitation Ana/ysis— Immimoprecipitation of p53 was 
carried out using 1 ^g of DO-1 (Santa Cruz Biotechnology) and 1 /xg of 
Ab-1 (Calbiochem) antibodies per 0.5 ml of cell lysate. Cell lysate was 
obtained by transient transfection of the corresponding vectors by Li- 
pofectAMINE Plus (Life Technologies, Inc.) into p53-negative Saos-2 
cells. Each transfected plate of cells was lysed in 0.5 ml of modified 
radioimmune precipitation buffer (25 mM Tris, pH 7.4, 125 mM NaCl, 
1% Nonidet P-40, 0.5% sodium deoxycholate with protease inhibitor). 
Sepharose-protein A (Amersham Pharmacia Biotech) was used to pull 
out the antibody-protein complexes, and pellets were washed four times 
with modified radioimmime precipitation buffer with 0.5% Nonidet 
P-40. Pellets were boiled in SDS-loading dye and run on 10% SDS- 
polyacryl amide gel electrophoresis. Western blot was carried out with 
anti-HA-biotin antibodies, with clone 12CAS (Roche Molecular Bio- 
chemicals), or with anti-p53 biotinylated antibodies (Roche Molecular 
Biochemicals) according to standard protocols. 

In Situ Hybridization on Nitrocellulose C Histoblot tingV — Whole 
body sections, 20 ptm thick, were prepared from frozen embryos embed- 
ded in blocks of Tissue-Tek O.C.T. and stored at -70 'C. Sections were 
placed on nitrocellulose (Schleicher & Schuell) to prepare histoblots as 
described previously (17). Histoblots were hybridized with ^^P-labeled 
RNA probes synthesized using Ambion's MAXI script in vitro transcrip- 
tion kit. m^i -specific sense and antisense probes were synthesized on 
the pBLUESCRIPT plasmid with the fragment of ingl cDNA corre- 
sponding to the PHD finger domain using T3 polymerase for the anti- 
sense and T7 polymerase for the sense probe. /3-Actin antisense RNA 
was synthesized using the template provided by Ambion. Histoblots 
were incubated in prehybridization solution (0.75 mg/ml yeast tRNA, 
0.75 mg/ml poly(A), 50% formamide, 0.3 m Tris, pH 8.0, 1 mM EDTA, 5X 
Denhardt's solution, 10% dextran sulfate, 10 mM dithiothreitol) at 42 "^C 
for 1-4 h. Hybridization was carried out for 12-24 h at 42 '^C in the 
same solution containing 1-5 x 10"^ cpm of probe/ml. After hybridiza- 
tion, histoblots were washed in 4X SSC at room temperature for 20 min, 
in 2X SSC, 0.04 ^-g/ml RNase A at 37 ''C for 30 min, in 2x SSC at 37 "C 
for 30 min, in Ix SSC at 60 "C for 15-30 min, and finally in 0.1 x SSC 
at 60 "C for 15-30 min and exposed to x-ray film for 2-7 days. 

RESULTS 

Multiple Transcripts of the Mouse ingl Gene Differ in Their 
5' 'End Sequences — In order to isolate the mouse ortholog of the 
INGl gene, we screened a cDNA library prepared from senes- 
cent mouse embryonic fibroblasts, using human INGl as a 
probe. The choice of the library was determined by the fact that 
INGl is expressed at higher levels in senescent, compared with 
normal, human fibroblasts (2). As a result, several clones were 
isolated, and the two longest were sequenced. The clones were 
identical to each other and highly similar to human INGl 
through most of their length, except for the 5 '-ends, which were 
different and not homologous to the human gene (Fig. LA). This 
observation could be an indication of alternative splicing of 
mouse ingl; however, it could also be potentially explained by 
cloning artifacts that occurred during the cDNA library prep- 
aration. To determine the structure of 5 '-end sequences of ingl, 
we used a 5 '-RACE technique for the isolation of cDNA se- 
quences corresponding to the 5 '-termini of ingl mRNA. cDNA 
was synthesized from mouse spleen and brain mRNA and 
ligated to synthetic adaptors. The cDNA was amplified by PCR 
using a sense primer specific for the adaptor and an antisense 
primer specific for the common part of ingl located close to the 
divergent region. Two fi-agments of different size were subse- 
quently cloned, sequenced, and compared with the clones iso- 
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Fig. 1. Cloning of different variants of mouse ingl cDNA and identification of their corresponding mRNA transcripts. A, schematic 
alignment of the mouse ingl cDNA clones and 5 '-RACE products with human INGl cDNA. la and lb represent mouse ingl cDNAs isolated from 
the library of senescent mouse embryonic fibroblasts. A 5 '-RACE reaction gave two products, one identical to lb and another shown as lc. The 
position of the gene-specific PCR primers used for the RACE reaction is indicated. Mouse ingl sequences are aligned with human INGl cDNA. The 
open reading frame for the human INGl clone is indicated. B, analysis of ingl transcription by Northern hybridization, ingl expression in mouse 
liver, heart, and testis was analyzed using a multiple -tissue Northern blot (CLONTECH Laboratories), which was hybridized with the probe 
corresponding to the common part of ingl {lane 1), the 5'-end of isoform la (lane 2), the 5'-end of isoform lb (lane 3), or the 5'-end of isoform lc 
(lane 4). Probes were obtained by PCR using -specific primers. An arrow shows the position of the RNA marker, 2.37 kilobases in size. 



lated from the cDNA library. The results of this comparison are 
schematically presented in Fig. lA, One of the 5'-RACE prod- 
ucts was identical to one of the previously isolated cDNA 
clones. Another product revealed the third variant of ingl 
cDNA, which again consisted of common and unique regions, 
with the junction located at exactly the same position as in the 
other sequences. Fig. lA shows schematic alignment of the 
isolated mouse ingl cDNA clones and the 5 '-RACE products. 
All variants are identical to each other (and homologous to 
human INGl) up to the same nucleotide and differ only in their 
5 '-ends. 

In order to verify that the observed cDNA structure reflects 
naturally existing transcripts, we analyzed ingl mRNA species 
by Northern hybridization using probes that represent common 
or specific regions of the isolated cDNAs. As shown in Fig. LB, 
the probe for the common region revealed multiple transcripts 
in RNA isolated from mouse liver, heart, and testis. Probes 
specific for individual transcripts, however, showed more sim- 
ple hybridization patterns, which in combination covered the 
whole set of transcripts found by hybridization with the com- 
mon probe. These results indicated that the cloned sequences 
represent the majority of multiple transcripts of ingl synthe- 
sized in the thymus and spleen. 

Mapping of Coding Regions of ingl in Mouse Genomic 
DMA — Comparison of sequences of individual cDNA clones 
showed differences in their 5' regions, suggesting that ijigl has 
multiple alternative 5'-exons (Fig. 1). To verify this hypothesis, 
we determined the structure of the mouse ingl gene. We iso- 
lated phage clones carrying sequences homologous to the 
mouse ingl cDNA by hybridization screening of a mouse 
genomic library. These clones were mapped by restriction di- 
gestion analysis, in combination with Southern blot hybridiza- 
tion, with the probes corresponding to the different ingl parts. 
The interpretation of the results obtained is shown in Fig. 2A. 

Comparison of genomic and cDNA sequences of ingl using 
PCR with different m^i -specific primers (data not shown) as 
well as with the sequencing data showed that most of the 
transcribed sequences of the ingl gene come from a single exon. 
Alternative 5 '-ends are encoded by different exons positioned 
upstream from the common exon. Comparison of the sequences 
of cDNA and genomic clones, as well as 5'-RACE products, 



revealed three isoforms of mouse ingl that differ from each 
other only in their 5 '-ends, which also indicates that each 
isoform is most probably expressed from its own promoter. 
Southern blot hybridization analysis of in^i -related sequences 
in the mouse genome indicated that ingl is a single gene with 
no obvious close family members (data not shown). 

The length of the isolated cDNA clones appeared to be sig- 
nificantly shorter than that of the mRNA species detected by 
Northern hybridization (Fig. LB), suggesting that part of the 
transcribed sequences were missing from the isolated cDNAs. 
To determine the exact start sites of ingl transcription, we 
used a new procedure called "SMAET-based 5 '-RACE" as de- 
scribed under "Materials and Methods." Using this method, we 
were able to extend the cDNA sequences for the 5 '-end of 
isoform lb. This exon contains an extremely G/C-rich region 
that blocked DNA elongation during the original 5 '-RACE re- 
action (Figs. 2B and 3) and complicated sequencing of the final, 
long 5 '-RACE product. Therefore, the transcription start site of 
isoform lb was not precisely identified, since it was estimated 
based on the size of the 5 '-RACE product and the analysis of 
the genomic sequence. 

Comparing the 3 '-ends of hximan and mouse INGl cDNAs 
showed that although the mouse transcript was flanked by a 
poly(A) stretch, its 3'-imtranslated region was significantly 
shorter than the human one. Analysis of the genomic clone 
with the mouse ingl sequence revealed the presence of a long 
poly(A) stretch that could potentially be used as a primer- 
binding sequence for reverse transcription initiated from oli- 
go(dT) primers (Fig. 3C). Moreover, the alignment with the 
3 '-end of the human INGl cDNA sequence resumes down- 
stream from this genomic poly(A) stretch. The exact end of ingl 
transcription in mice was determined using the same method 
applied for the generation of 5 '-sequences. Mouse ingl cDNA 
was, in fact, found to be longer than was originally thought 
(Figs. 2B and 3). Sequence comparison showed that mouse and 
human INGl transcripts terminate at the same point and 
share significant levels of similarity up to the very 3 '-ends. 

Sequence Analysis of ingl — Nucleotide and predicted amino 
acid sequences of the three mouse ingl cDNA isoforms and the 
promoter regions for all three alternative transcripts are shown 
in Fig. 3. It was found that ingl isoforms la and lc have several 
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Fig. 2. Mapping of mRNA-coding 
regions within the mouse ingl locus. 

A, map of the mouse genomic region con- 
taining ingl cDNA-related sequences. 
Three i^i -positive genomic clones were 
isolated from the mouse genomic library 
using ingl cDNA as a probe (G2, G9, and 
GIO), and analyzed by restriction diiges- 
tion and Southern blot hybridization. 
This analysis, in combination with the 
PGR data and with partial sequencing of 
the genomic clones, allowed us to deter- 
mine the positions of the ingl exons. S, 
determination of transcription initiation 
and termination sites in the ingl gene. 
The upper panel shows the genomic map 
with the positions of the ingl exons and 
the structure of ingl cDNA clones as de- 
termined after cDNA Hbrary screening 
and 5 '-RACE analysis. The structure of 
two known human INGl cDNA isoforms 
in relation to mouse ingl sequences is 
shown above the genomic DNA. The lower 
panel demonstrates the final structure of 
the ir^l gene determined as a result of 
precise mapping of transcription initia- 
tion and termination sites. In the upper 
paTiely the position of the G/C-rich region 
in the 5 '-end of clone lb and the poly(A) 
region in the 3' part of the ingl tran- 
scripts are indicated, which interfered 
with the polymerase chain reaction dur- 
ing the original 5 '-RACE. 
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tightly clustered transcription start sites. Sequences upstream 
of the initiation sites (putative promoters) lack TATA boxes, 
but they do contain a sequence corresponding to the loose 
initiator consensus PyPyAN(T/A)PyPy that includes the tran- 
scription start sites (Fig. 3). In both cases, areas upstream from 
the initiator are extremely GC-rich with multiple Spl-binding 
sites. All of these features are typical of TATA-less promoters 
(18, 19). Binding sites of inducible factors that are usually 
present further upstream in the promoter area {e.g. CAAT box, 
Oct-1 and Oct-2 binding site, NF-kB, or ATF binding site) were 
not found in these promoters. Transcription of ingl isoform lb 
is estimated to start approximately 30 nucleotides downstream 
from the TATA-like box. A putative CAAT box is present 60 
nucleotides upstream from the TATA-like sequence. Since the 
transcription initiation sites for isoforms la and lb are only 
about 200 base pairs apart, there is a possibility that their 
promoters share some of the regulatory sequences including, 
for example, a number of Spl binding sites located upstream of 
this area. 

All three alternative transcripts of mouse ingl contain the 
same long open reading frame, although the sizes of the pre- 
dicted proteins are different. While isoform lb encodes a pro- 
tein of 279 amino acids, the other two isoforms are predicted to 
encode a shorter protein product of 185 amino acids, which 
lacks 94 N-terminal amino acids. Translation of these two 
products is expected to start from the initiation codon, which is 
located at the beginning of the common exon. Alignment of the 
predicted amino acid sequences with that of human p33^^^^ 
(GenBank™ accession number AF044076) revealed high sim- 
ilarity between the mouse and human proteins (89% sequence 
identity) (Fig. 4). 

The nonredundant protein sequence data base at NCBI was 
searched using the in^i -encoded protein sequence as a query, 



and a highly significant sequence similarity was detected with 
a human paralog of p33^^^^. These included three uncharac- 
terized proteins from the budding yeast Saccharomyces cerevi- 
siae and their homologue from fission yeast Schizosaccharomy- 
ces pombe (probability of the similarity being observed by 
chance <10"^^). All of these proteins are approximately the 
same size and contain a C-terminal PHD finger domain (20, 
21). The sequence conservation in the PHD domain in these six 
protein sequences is striking. There are 13 invariant residues 
in addition to the eight metal-chelating cysteines and histi- 
dines that are conserved in all PHD fingers (Fig. 4B). Multiple 
alignment analysis resulted in the delineation of an additional 
N-terminal region that is conserved in these proteins (Fig. 4B). 
This region consists of approximately 100 amino acid residues, 
includes two distinct conserved motifs, and shows a fairly sub- 
tle similarity that was not statistically significant in the con- 
text of the screening of the complete data base, except for the 
conservation between the two hiiman paralogs. However, in the 
reduced search space defined by the presence of the PHD fin- 
ger, it was shown that the probability of finding this level of 
similarity by chance was less than 10"^^ for the distal motif 
and 10"^ for the proximal motif. 

Expression of ingl Transcripts in Adult and Embryonic 
Mouse Tissues — We analyzed the expression of ingl mRNA in 
the organs of adult mice and mouse embryos at different stages 
of development by Northern blot hybridization. The probe rep- 
resenting the common exon of the ingl gene revealed multiple 
mRNA size classes that represent alternative transcripts of 
ingl (Fig. 5A). Overall ingl mRNA expression is most abun- 
dant in thymus and testis. Much lower levels were detected in 
the rest of the tissues tested, and they also differed in the 
content and relative intensity of the hybridizing bands. The 
same pattern of ingl miRNA expression was observed in p53- 
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CTiCAGTCCCA GCCATACAAA GGCGCGAATG TGGCGACCAC AGGCCAGGCT OCCOCCCACC 

GCStTTGCCTA GGCAGGAAGA (aKKGSGflCC GCCGGGGCCC GTCGCCTAGG CAACCCCTCG 

CTCTGCCCGG GCTTTTCAGA AAGCCCTAAA -J'CCGGCAGCT GTGSGGGAAG SGCGGGA'rGG 

GAAGCTGAAG GT'J'GTGC'I'CC CCTGCGGGAA GCACACTGCG GTGCCCGCtG CCAACTCAGT 

gtgacgagga catgggcagc ctcagatgcg gcagggaggg gaagagctag ttaggatcag 

agaagagctg ctcccgccag cccgcccacc gtagatctcc gcgctgaaaa gagcacccct 

cgggggcccc tttgtctcca ggccattcta aacccagcac cgggaggcga cacaaaggga 

cgatggcccc ccaaggcgca ggcgcgtgga ggcgccgagg atgctgggag tggtggtccc 

TTTGCGTGTC GATAGCTGCC CTCCCGCGCG GGTCCGCATC CATCGCCAGG GAGCTGCGCC 

TCCCCCTC£A_^l£CCCCTC GCTCCCCTCC CCCCCCTCCT CCCTCCTCCA ACAGCACACC 

TCTCTAGTAC TAGGCGTGCG GGGGGCGTTG CCGATCCCAC CACACCTCCT TCTCGTCCAff 

JUTCCeC GCA GAGGITAGAG TCGCAAGGGG ATTGGCTGAC GCCGTTGCCA rTTCTGTTCG 

gxnn'* ■ ' ' — ' iittrvn ? | ^ 

GGGGCCCTAT TTATACCGCG CTAGGAGGCG GGGGAA7GCA GTTAACCCAG CGGAGGGTGG 

CTCTTCTCAG GTTAGTCGGT TTGAAGGCAG CGA7CTGGAG GCTCG7CCGG CCCCGGCGTG 

TCTCGAGGAG ACAGGCTAGC ACCCCGCGGT TCCCGGCCGG CGTGCGGCTC CGCCCAGTAG 

GTCCGCCACC GAGTGTTAAA CTCCTAGTAA AGT7TCGCGT CGCCGGTCTC CCCTGCCCAG 

GCGACCCCGC CGCTGTCCCC TCGGCGACCC TGGCCCCCAG CGACAATGAC AGGCCACGCC 

CCCGCGCGGG CCGGGCTCGC GGCCCGCCGC CCCCCGGCCC GGGRCGGTGA GGGGCGTGAA 

TGCGGTGGGG GCGGGGCCGC CTCCGGGAGG AGGGGTGGCG GAGCGCATGC GCGCTGCGCG 

CGGGGCTGAA rGTTTCCCAA GTGTTTGAAA CTGGTA7TTG GGTTT7CCAC GTTGGACAAG 

TGCGGCGAGG CGGGCGGCGG AGCGCGCCCT TCCCGCAGCC GGCCCCGCTC TCTCCGCTCT 

CCACGCTGGT GGCGGTGTGG GCGGCGGTGG CCGGCCCTCT TTGGGTGTGT GCGCCCTAGT 

ACGCACGGCC CCCCGCGCCG CCCGCCGCCC GGGAGGGGGC CTG-ACGGCC GGCGGGGTGT 

GCGCTTGAGC TTCGGCCGCC GCGGCCCGCG CCTGCAGGCG CCGSGGTCCC CGCGGCCCCG 

GAGGCGGCGC 6GCAGGCGCC AGATGTAGCC GCCGGCCAGC CCGSCCGGAG CGGCGGGGGG 

GCGCGGAAGG GCGAGAGAGC TTTGCAT7TT GCAGTGCTGT TTGAGGGGGG CGGGGGGTGG 

AGGAAGCGGA AAGCCGCCGA ATCGCCGGGG ACC7CCGGGG TGAACCATGT TGAGTCCTGC 

CAACGGGGAG CAGATCCACC TGGTGAACTA TG'J'GGAGGAT TACCTGGACT CAATCGAGtC 

ACTGCCrrrC GaCCTGCAGA GGAACG'ICtC GCTGATGCGG GAGATCGACG CCAAATACCA 
AGGTACGGCG 
txon ^ 1 introR 



B 



ACAGAGTCAT tgggggqcgg gcagtatggc cccgogcttt gctcgggact 

TGTGTTTCCG CTGTCTGCTT ttTTTTTTTT TTTTTTCCTC TGGTAGAGGG 
AGAGGAGGTG AGTTGAT7TG AATGTCTTCG GGTCGCCCGT CCTCTGGCCT 
CCTGTCGCTG CTGCGGCGOG CCACCGJCGC TGGCTC-'GCG CGC'J'GATTGG 
GGTCTCCCCC CTCAGCGrCG TCGACTCACA TAGGCTTCGff GCGGGGCGGG 



n 

GCGCGCTTCC TGAGTCTCGT AGGCtGGAGT GGA-'CGCGGC^CXCJXZCCGG 
ATGGCGGCGG TGGCCTCCGG GAGGATGCTG CGC7TACCTT CTCTGCTCTG 
GGAGCCTCTG ATCGCTTGTC GCGTTTCCGG TAGGCTTGAA TGAGCGGGAG 
exon ^— ' intron 



CGCTCCAGTC ' 

CCGGGCG<S\G 

TGGGTTGGCT 

TGTACCTCCJ' 

GCAGGCCSSG 

► JC 

CTGCGAGGCT 
GCTCCCCGCG 
TGCCTCTGTC 



Exon la 



ACTTCCGCCTCGCTCGGCTCCGCCCCCTCGTCCCTCCTGGAAGAGGACRGCTCTCTAGTA €0 

CTAGGCGTGCGGGGGGCGTTGCCGATCCCACCACACCTCCTTCTCGTCCAGATTGGCCGC 120 

AGAG 124 

Exon lb 

GTTAACCCAGCGGAGGGTGGCTCTTCTCAGGTTAGTCGGTTTGAAGGCAGCGATCTGGAG 60 

GCTCGTCCGGCCCCGGCGTGTCTCGAGGAGACAGGCTAGCACCCCGCGGTTCCCGGCCGG 120 

CGTGCGGCTCCGCCCAGTAGGTCCGCCACCGAGTGTTAAACTCCTAGTAAAGTTTCGCGT 180 

CGCCGGTCTCCCCTGCCCAGGCGACCCCGCCGCTGTCCCCTCGGCGACCCTGGCCCCCAG 240 

CGACAATGAC AGGC CACG CCCCC GCGC GGGCCGGGCTCGCGGCC CGCCGCCC CCCGGCCC 300 

GGGACGGTGAGGGGCGTGAATGCGGTGGGGGCGGGGCCGCCTCCGGGAGGAGGGGTGGCG 360 

GAGCGCATGCG CGCTGCGCGCGGGGCTGAATGTTTC CCAAGTGTTTGAAACT GGTATTTG 420 

GGTTTTCCACGTTGGACAAGTGCGGCGAGGCGGGCGGCGGAGCGCGCCCTTCCCGCAGCC 480 

GGCCCCGCTCTCTCCGCTCTCCACGCTGGTGGCGGTGTGGGCGGCGGTGGCCGGCCCTCT 540 

TTGGGTGTGTGCGCCCTAGTACGCACGGCCCCCCGCGCCGCCCGCCGCCCGGGAGGGGGC 600 

CTGCACGGCCGGCGGGGrCTGCGCTTGAGCTTCGGCCGCCGCGGCCCGCGCCTGCAGGCG 660 

CCGGGGTCCCCGCGGCCCCGGAGGCGGCGCGGCAGGCGCCAGATGTAGCCGCCGGCCAGC 720 

CCGGCCGGAGCGGCGGGGGGGCGCGGAAGGGCGAGAGAGCTTTGCATTTTGCAGTGCTGT 780 

TTGAGGGGGGCGGGGCGTGGAGGAAGCGGAAAGCCGCCGAATCGCCGGCGACCTCCGGGG 8 40 

TGAACCATCTTGAGTCCTGCCAACGGGGAGCAGATCCACCTGGTGAACTATGTGGAGGAT 900 
MLSPAHGEQIHLVNyVED 

TACCTGGACTCAATCGAGTCACTGCCTTTCGACCTGCAGAGGAACGTCTCGCTGATGCGG 960 
YLDS IB8LPFDLQRKVSLMR 

GAGATCGACGCCAAATACCAA6 9B2 
E I D A K ¥ Q 



Bxon Ic 

CACTTTCCGGCTGCGAGGCTATGGCGGCGGTGGCCTCCGGGAGGATGCTGCGCTTACCTT 
CTCTGCTCTGGCTCCCCGCGGGAGCCTCTGATCGCTTGTCGCGTTTCCG 



60 
109 



AGATCCTGAAGGAGCTGGACGACTACTATGAGAAGTTCAAACGGGAGACAGACGGCACCC 60 
EILKELDDyXEKFKRETDGT 

AGAAGCGCCGGGTACTGCACTGCATCCAGAGGGCCCTGATCCGCAGCCAGGAGCTAGGCG 120 
QKRRVLHCIQRALIRSQELC 

ATGAGAAGATCCAGATCGTGAGTCAGATGGTGGAGCTGGTGGAGAACCGCAGCAGACAGG 180 
DEKIQIVSQMVBLVEHRSRQ 

TGGACAGTCACGTGGAGCTCTTCGAAGCACACCAGGACATCAGTGACGGCACTGGTGGCA 240 
VDSEVSLFEABQDISDGTGG 

GCGGCAAGGCGGGCCAGGACAAGTCGAAGAGTGAGGCCATCACACAGGCAGAT AAGCCGA 300 
SGKAGQDKSKSEAITQADKP 

ATAACAAGC GGTCCAGGAGGCAGCGAAACAATGAGAATCGAGAGAACGCGTCGAATAATC 360 
NNKRSRRQRNNEHRSNASHN 



ACGACCATGATGACATCACCTCAGGAACGCCCAAGGAGAAGAAAGCAAAAACCTCAAAGA 420 
BDBDDITSCTPKEKKIKTSK 

AGAAGAAACGCTCCAAGGCCAAAGCAGAGAGGGAAGCGTCTCCTGCCGACCTTCCCATCG 480 
KKKR8KAKAERSASPADLP1 

ACCCCAACGAGCCCACGTACTGTCTGTGCAACCAGGTCTCCTACGGGGAGArGATCGGCT 540 
DPNEPTYCLCNQVS YGEMIC 

GTGACAACGACGAATGCCCCATCGAGTGGTTCCACTTCTCCTGCGTGGGGCTCAACCATA 600 
CDNDECPIEWPHFSCVCLHH 

AACCAAAGGGCAAGTGGT ACTGCCCCAAGTGCCGTGGGGAGAGCGAGAAGACCATGGACA 6 60 
KPKGKWYCPKCRGBSEKTMD 

AAGCCCTGGAGAAGTCCAAGAAAGAGAGGGCTTACAACAGG rAgrGA GTGGACACTCACC 720 
KALEKSKKERAYHR 

CGTGGTCAGTGACACAGCCACCAGTGTGTTTATGGTATCGCTGCCTTCGTGGAAGTCCGA 780 
GGGCAGTCAGATGAGTATTTTAGAGAATGTTAGCCGTGCCTCCTCTCCTCGGATGGCGGA 840 
GACAGCCTGCGTCTTCATGGTGTACCTTGTTCCCAGACTGGGTCTGCAGAGCCGCAATTT 900 
AGAAACTACAAATACAGGTTCG AATTAACCATGTC AGCGAGTGTCAGACTGATTTTGGGG 9 60 
GTTGGGGAGTGACTTGGAAGTAAGCTAACAGCTGTATAAAGAAGAGATTTCATTTGGCTG 1020 
TTTTGACAAAAATGTTACGGTTTACAAAAAAAAAAAAAAAAATTAGCCAGGTTGCTGAAG 1080 
TACAACAAGCGTGCTTCTTGCTACCATAATGTATATCCACACGACAAGTTGGTGGTCTAA 1140 
AGTCTAAATATTATTATTTTTTAAAGAGGTAAATGGGTAAATTTTACATGACATATTTTA 1200 
TACATGGCCTATTCCCTAACTGGCCATTTTTAATGACTGGGTACATTTTTAATAGGTCAG 12 60 
ACAAAAGTGGTCCAGGCAGTGGGTCCGftGTCTTGCCTTTGCT ACCCT AGGTCATGGTGTA 1320 
GCCACCTTTAACTTATATGAAGTGTATAAATGTACATCTTGCCCCTGCTGTATCATAACC 1380 
GGAAGTGCAGCCTGGTGCTGTTATGAAGCCAAGGTGTGCATCCTGCTGCGTGTGTGAGCT 1440 
GTATAGATGTTGCGTCAAGAAATAAATGAAACTTGGCCAGTTTGTTTCTTTAGCATT7AA 1500 
TTTTAACATACAAATAATTTTAAATTTTTGTCTTAAAAATTTATACACCAGCAGTTTAGA 1560 
CAAAGCCTTAAGCAAATTGTATATTATTGATCTCACGTAATAAGGAAGTAGGCATTATGT 1620 
CCATGCCAGCAAATACATGTCAAACGTGTTTAGACCAGAGGTTAGGACGTGACTGTGCTT 1680 
GTGAGGTCCAGCCCCACCATCTGTTTTTGTATGGCCTGTGAGCTAAGAATGGTCTCTATG 1740 
TTTCTGAGTGGTTTTTAAAATCACATTTCATGATACACGAAAGTCCTATGAAATGAAGAT 1800 
TTGCGTGTCTATAAATAAAGACTCATTGGAACACC 1835 



Fig. 3. Sequences of the putative promoter areas of the ingl gene, ingl cDNAs, and their predicted protein products. A, sequence 
of the genomic region with the putative promoter areas of cDNA isoforms la and lb; the start sites of transcription of these two variants are marked 
hy arrows (the start site of isoform lb was estimated based on the size of the 5 '-RACE product and the analysis of genomic sequence). The initiator 
sequence of isoform la, including multiple initiation start sites, is underlined; TATA-like sequence for isoform lb is shown in boldface type. The 
underlined boldface italic sequence is the CAAT box positioned about 100 base pairs upstream from the lb transcript initiation site. Spl binding 
sites are displayed in boldface italic type, B, sequence of the putative promoter area of isoform Ic, with the underlined initiator sequence 
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mouse MLSPAKGEQIHI-WYVEDyiI)SIESLProLQRNVSIMREIDJUCYQEIIJ?EII)DyyEK^ 60 

human MLSPANGEQLHLVNYVEDYLDSIESLPr DLQRNVSLMREIDAKYQEILKELDECYERFSR 60 

mouse ETDGTQiatKVLHCIQRALIRSQEL(roEiaQrVS(^4VraLVENRSRQV^ 120(26) 

human ETDGAQKRRMLHCVQRALIRSOELGDEKIQIVSQMVELVENRTRQVbSHVELFEAQQELG 120 

mouse DGTGGS(aUU3QDKSKSEAITQAOKPNNKRSIUlQRNNENR£N2^Nl^HDDITSSTPKKK^ 180(86) 

human DTVGNSGKVGADRPNGDAVAQSDKPNSKRSRRQRNNENRENASSNHDHDDGASGTPKEKK 1 80 

mouse IKTSKKKKRSKAKAEREASPJU>I^IDPNEPTyCIX3igySYSmaGCDNDECPIEWF^ 240(146) 

human AKTSKKKKRSKAKAEREASPADLPIDPNEPTYCLCNQVSYGEMIGCDNDECPIEWFHFSC 24 0 

mouse VGLNEKPKSKWYCPKCRGESEKTMDKALEKSKKERAYKR 279(165) 

******************* ******************** 

human VGLNHKPKGKWYCPKCRGEKEKTMDKALEKSKKERAYNR 27 9 



B 



P33INGl_Hs_2e29208 
P3 3 INGlp_Hs_3 041855 
YNL097c_Sc_2131886 
yOR06ac_Sc_1420209 
YHRO90c_Sc_626641 
SPAC3G9_Sp_2706459 
Consensus 1 

Houft* IttoKlb) 

P33INGl_Hs_2829208 
P33INGlp_Hs_3041855 
YNL097c_Sc_2131886 
YOR064c_Sc_1420209 
YHR090c_Sc_626641 
SPAC3G9_Sp_2706459 
Consensusl 

YPL138c_SC_12 44777 

CllG6.3_Ce_1229037 

SPBC17Dll_Sp_3 650403 

Mi-2_Hs_I585696 

CHD3_Hs_2645433 

KIAA0333_Hs_2224607 

KIAA0215_Hs_l 504012 

F33E11.3_Ce_315849B 

ConsensuaZ 



22 lESLPFDLQRNVSLMREinXKyQElLKELDD 
22 lESLPFDLQRNVSLMREIOAKYQEILKELDE 
? lEOLPMDLRDRrTEMREMDLQVOKAMDQLEO 
17 LttKPPLATSRYLTLLHEIHAKCVHSMPNLNE 
20 LDHLPCELIRSLRLMQTIDLFKNEETDEPGME 
13 VSHLPSEFRYLLEEIGSNDLKLIEEKKKYEO 
15 LDMVPNETKHirDElSVKEVAVHDIWKRIQA 
hpphP h h -h p 

IVSY---OEBIII 
tVSY— GEMK 
fVSY---0] 

IVAT-— o: 

OPMVAHDMPi 

iRVsr- - -gehvaSdcpj 

iGSr— -GQMVJ 
TC bCpp sa O Hh»CDs 



13 KRRVLHCIQRALIRSQELGDEKIOIVSQKVELVESRSRQVDSHVEL 67 

13 KRRMLHCVQRALIRSOELGDEKIOIVSQMVELVEBIRTRQVDSHVEL 6? 
16 REE0MASlKrDTYKALEDAOEKV0LANQIYDLVDBHLRiaJ>QELAK 75 
16 QVRLLNKINKIYEfilJ1PSI.BEKMHVSSI«LDNLDRLTSRLIlAyEV 67 
29 ELEIOKSVTKNrNSSLENIlCSia.TI.ESPGAyKEPKLLLiaMLKKAK 41 

14 EDGLDKEIlCESLIJCCOSLQRElXVIJ\NTALFI.IARHLNKlJaCNIAL 73 
14 EDALYSTIREET0KAINI0MElCVOLADRARLGLTRHIiail.DDRLAK 7 3 

p bpp h p K>Kh bp p pfap 
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Fig. 4. Protein sequence analysis of p37*^^. A, alignment of amino acid sequences of mouse ingl lb isoform and its human ortholog. 
Numbers of amino acids are indicated; numbers in parenthesis indicate the size of the truncated protein product of la and Ic ingl isoforms. The 
underlined methionine in the mouse sequence indicates the beginning of the protein product encoded by isoforms la and Ic. The asterisks indicate 
identical amino acids, while dots indicate conserved changes in amino acid sequence. B, a multiple alignment of p33'^^^, its yeast homologs, and 
additional PHD finger-containing proteins. The aligned conserved blocks are separated by variable spacers w^hose lengths are indicated by 
numbers; for the N-terminal block, the distance to the protein N termini is indicated (the sequence of the human paralog of pSS"^^-* is incomplete). 
Consensusl shows amino acid conservation in the p33^^°^ family of proteins; Consensus2 shows the conservation in an expanded set of PHD finger 
proteins (in addition to the six proteins of the p33^^^^ family, the sequences that aligned with p33'^^^ with a probability of a random match below 
10"* in the first iteration of the PSI-BLAST analysis were included). Each consensus shows amino acid residues conserved in all sequences of the 
respective set; h indicates a hydrophobic residue, p indicates a polar residue, - indicates a negatively charged residue, s indicates a small residue, 
and a indicates an aromatic residue. The metal -chelating residues in the PHD finger domain are shown by white type on black. Each protein is 
identified by a gene name foUowed by species abbreviation and the gene identification number fi-om the nonredundant protein data base at the 
NCBI. p33'^^^p is the human paralog of p33'^^^ Mi-2 is a human autoantigen, and CHD3 is a chromatin-associated heUcase of the SNF2 family; 
the remaining proteins are uncharacterized gene products designated by their systematic gene neimes. Species abbreviations are as follows. Hs, 
Homo sapiens; Sc, Saccharomyces cerevisiae; Sp, Schizosaccharomyces pombe; Ce, Caenorhabditis elegans. 



deficient mice, which showed no direct effect by p53 on ingl 
regulation (data not shown). In embryos, the highest expres- 
sion was found on the 11th day of development and was char- 
acterized by changes in the relative expression of different 
classes of mRNA transcripts. 

Patterns of ingl expression in embryogenesis were also 
checked by in situ hybridization. We used the histoblotting 
technique, in which embryonic sections were fixed on a nitro- 
cellulose membrane and then hybridized with the RNA probes 
corresponding to the common part of the ingl cDNA. Besides 
sense and antisense m^i-specific RNA probes, we also used an 
antisense probe for ^-actin, a gene with ubiquitous expression, 
as an internal standard. Sections were prepared fi*om 10-, 12-, 
16-, and 18-day embryos. Results of in situ hybridization are 
shown in Fig. 5C. ingl is imiformly expressed in the whole 
mouse embryo at all stages of development examined. This is in 



agreement with the result obtained firom the adult mouse tis- 
sues, where ingl is expressed ubiquitously and at similar levels 
in all organs analyzed. However, in day 10 embryos, higher 
expression levels were observed in the yolk sac, while at day 16 
and 18 of development, higher levels of expression were de- 
tected in inner compartments of bones and probably match 
areas of ongoing ossification. 

In order to determine patterns of expression of different ingl 
transcripts in mouse tissues and during embryogenesis. North- 
em blots were hybridized with probes corresponding to the 
alternative start sites of ingl (Fig. 5, A and B). Isoforms lb and 
Ic were expressed ubiquitously in all adult mouse tissues 
tested, with the highest levels in thymus. Both isoforms were 
expressed in the embryos at all stages of development ana- 
lyzed, with the highest levels at day 7 in the case of isoform Ic 
or day 11 in the case of isoform lb. Out of all adult tissues 



overlapping the two transcription start sites. Multiple Spl binding sites are also indicated. C, sequences of ingl cDNAs and their predicted protein 
products. Sequences of the alternative la, lb, and Ic first exons as well as the common ingl exon are shown. The first ATG codon of isoform lb 
is underlined as well as the ATG codon in the common exon that is used as the initiation codon for la and Ic translation. Stop codons, indicating 
the end of translation, are also underlined. 
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Fig. 5. ingl expression in adult mouse tissues and embryos. A, results of Northern hybridization of total mouse RNAs isolated from the 
indicated organs with the indicated probes; a photograph of an ethidimn bromide-stained gel is shown as a loading control. B, Northern 
hybridization of RNA on a CLONTECH mouse embryonic multiple-tissue Northern blot CmRNA samples from mouse embryos at 7, 11, 15, and 17 
days of development) with the same probes. The arrows indicate positions of the 2.37-kilobase RNA marker as well as the positions of 28 and 18 
S rRNAs. C, ingl expression in mouse embryos analyzed by in situ histoblot hybridization. Histoblots were prepared from whole body sections of 
10-, 12-, 16-, and 18-day embryos and hybridized with the sense and antisense in^i -specific ^^P-labeled RNA probes corresponding to the region 
encoding PHD domain. Antisense probe for ubiquitously expressed actin gene was used as a control. 



analyzed, mRNA for isoform la was expressed only in testis 
and also in the 11 -day embryo. While there is no detectable 
signal with the la-specific probe in the 7-day embryo, traces of 
hybridization could be detected in mRNA firom day 15 and day 



17 embryos, which indicates an extremely low expression level 
at these points of mouse embryogeiiesis. 

Indications of Proliferation-dependent Regulation of ingl Ex- 
pression — To check whether ingl expression was proliferation- 
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Fig. 6. ingl mRNA and protein expression varies depending 
on cell growth conditions. A, RNA was isolated from dividing 10(1) 
{lane a) and NMuMG {lane e) cell lines, as well as from the dividing 
young mouse embryonic fibroblasts {lane h). RNA was also isolated 
from contact-inhibited 10(1) {lane b) and NMuMG cells {lane /); serum- 
starved 10(1) cells {lane c) (64 h at 0.5% FBS); y-irradiated 10(1) cells 
{lane d) and NMuMG cells Uaneg); and senescent MEFs {lane i). Filters 
were probed with total ingl probe. The bottom panels show RNA gels 
from which Northern blots were made. The arrows indicate positions of 
the 2.37-kilobase RNA marker {upper panels) or 28 and 18 S rRNAs. B, 
detection of in^i -encoded proteins by Western immimoblotting, using 
antibodies against p33^^^^. Cell lysates were prepared using radioim- 
mune precipitation buffer from the cells treated in the same way as 
explained above. As a control, cell lysates were prepared from 293 cells 
transfected with the plasmids expressing mouse lb ingl variant {lane a) 
or the human homologue of p31'"*^ that produces the protein product of 
the same size as mouse Ic and la variants. 

dependent, we analyzed RNA from two mouse cell lines, 
NMuMG and 10(1), at different growth conditions by Northern 
blot hybridization using total ingl cDNA as a probe, ijigl 
expression was also compared in senescent versus young, di- 
viding mouse embryonic fibroblasts. Results are shown in Fig. 
6A. In both cell lines, ingl was expressed at higher levels in 
dividing compared with quiescent cells (quiescence is induced 
by serum starvation, contact inhibition, or 7-irradiation). How- 
ever, in 10(1) cells the difference in expression was specific only 
for the upper ingl specific band, which corresponds to isoform 
lb. In mouse embryonic fibroblasts, ingl was expressed at very 
low levels without any differences between dividing and senes- 
cent cells. 

ingl expression in cell lines was also analyzed by Western 
blotting using an IgGl mouse monoclonal antibody produced 
against human recombinant p33^^^^ (16) (Fig. 6B). This anti- 
body has previously been shown to specifically detect the de- 
natm-ed form of mouse p33^''^*^^ protein in Western immunoblot 
protocols. Cell lysates were produced from dividing, contact- 
inhibited, serum-starved, or y-irradiated 10(1) and NMuMG 
cells. In both cell lines, two protein products were detected, 31 
and 37 kDa in size, which correspond to the truncated INGl 
and lb ingl protein products, respectively. The 37-kDa protein 
was present at the same levels in dividing and quiescent cells, 
while the 31 -kDa protein was present at higher levels in divid- 
ing compared with nondividing cells. The variation of expres- 
sion of ingl differs significantly at mRNA and protein levels, 
indicating that this gene could be regulated at the level of 
transcription or protein stability. 

Overexpression of pSf"^^ Inhibits p53 Function — All previ- 
ous functional analysis of INGl was done using human cDNA 
expressing the shorter protein encoded by this gene (1, 2, 4). As 
a result, p33^^^ was defined as a negative growth regulator 



that cooperates with p53 in transcriptional activation of p53- 
responsive genes. Its moxise homologue, p31^'^^, has a similar 
biological effect (data not shown) that is consistent with the 
high degree of identity between the mouse and the human 
proteins (see Fig. 4). To analyze the influence of the longer 
product of ingl on p53 function, we overexpressed cDNA for 
p37in^fj mouse cells maintaining wild type p53. To effectively 
monitor p53 -dependent transcriptional activation, we used pre- 
viously characterized ConA cells (see "Materials and Meth- 
ods"), carrying the p53-responsive bacterial /acZ gene. We 
transduced ConA cells with retrovirus expressing p37"*^^ and 
enhanced green fluorescent protein from a bicistronic mRNA 
and generated cell populations with different levels of overex- 
pression of p37'"^'^ by sorting cells differing in the levels of 
enhanced green fluorescent protein fluorescence. These cells 
were compared with ConA cells transduced with either the 
empty pLXIG vector or with retroviral vector expressing the 
short protein product of ingl. The expression of introduced 
constructs was confirmed by Western immunoblotting using an 
anti-p33^^*^^ antibody that recognizes both human and mouse 
proteins (16) (Fig. IB), p53-dependent expression of lacZ was 
determined by X-gal staining of ConA cells treated with UV 
light (Fig. 7A). The expression of p53 protein and p53-respon- 
sive p21"'''^^ protein in untreated and in UV-irradiated cells 
was estimated by Western immimoblotting (Fig. 7B), A de- 
creased intensity of the p21 band by Western blot and a weaker 
X-gal staining of ConA cells treated with UV light indicate that 
p37'"^^ clearly inhibited UV-induced accimiulation of both p53- 
inducible proteins. Moreover, overexpression of p37'"*"' 
strongly attenuated accumulation of p53 in response to UV 
light (Fig. 7B). These observations demonstrate that p37"^^ 
has a suppressive effect on p53-dependent transcriptional ac- 
tivation supposedly by attenuating p53 accumulation in re- 
sponse to DNA damage. This means that the two proteins 
encoded by the ingl gene have opposite effects on p53 function. 

It was previously shown that p33^^''^^ can be detected in cells 
in a complex with p53 by co-immimoprecipitation (4). Since at 
that time it was not known that INGl encodes two different 
proteins, it was unclear which of the two products of INGl was 
detected in these experiments. We therefore compared the abil- 
ity of short and long /iVGi-encoded proteins to form a complex 
with p53 by using immimoprecipitation. Since the protocol of 
immunoprecipitation was optimized for human proteins, we 
analyzed human variants of INGl proteins expressed in hu- 
man cells by transient transfection. p53-deficient Saos-2 cells 
were used in these experiments that have both alleles of the 
p53 gene deleted and presumably retain an intact p53 path- 
way. They were co-transfected in different combinations with 
the expression plasmids encoding wild type p53 and the short 
or long products of INGl. Cell lysates were treated with anti- 
p53 antibodies (see ''Materials and Methods"), and the presence 
of /A^G J -encoded proteins in precipitates was detected by West- 
ern immunoblotting (Fig. 7C). The results obtained showed 
that although p37^^^'^ is co-precipitated with p53, the shorter 
INGl product is not detectable in the p53 precipitate. This 
observation suggests that differences in the effect of the two 
INGl products on p53 fxmction could be a reflection of differ- 
ences in their interaction with p53 in the cell. 

DISCUSSION 

Multiple Products of the ingl Gene— INGl was originally 
described as a gene whose suppression promotes neoplastic 
transformation (1). Consistent with that, ectopic expression of 
the first isolated human cDNA was grovdih-suppressive for 
different cell lines. INGl was foimd to encode a nuclear protein 
termed p33^^^^ that was shown to functionally and physically 
interact with p53 in cell growth regulation (4, 22). Now, after 
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Fig. 7. p37'"*^ affects p53 function. A, dependence of p-galactosidase activation after UV irradiation of ConA cells on the expression levels of 
pgr^i^gj ConA cells expressing the human homologue of pSl^'*^ {lane a); p37*"*^ at low {lane c), medium {lane d), or high {lane e) levels; and ConA 
cells infected with pLXIG vector alone {lane b) were equally plated in a 12-well plate, UV-irradiated (25 J/m^), and X-gal-stained 16 h after the 
treatment. A fragment of the 12-well plate and microscopic views of individual wells are shown. B, expression of p53 protein, p53-responsive p21"'°^^ 
protein, and ingl protein products in untreated and in UV-irradiated ConA cells was estimated by Western immunoblotting. Analyzed cell lysates 
were isolated using radioimmune precipitation buffer from the cells treated in the same way as explained for A. C, coimmunoprecipitation of p53 
and INGl protein products in Saos-2 cells. Detection of p53 {upper panel), p37^^^^ {middle panel), and p31^^^^ {lower panel) proteins by Western 
immiinoblotting is shown. Lanes /", and i {top and middle panels), lysates from Saos-2 cells co-transfected with p53 and p3 7^^^-^ -expressing 
plasmids. Lanes f, g, and i {bottom panel), lysates from Saos-2 cells co-transfected with p53 and p31^-^^^-expressing plasmids. Lane h, lysate from 
control nontransfected cells. Lanes f and g contain the products of immunoprecipitation with anti-p53 antibodies {lane g) and control nonspecific 
antibodies {lane /). 



we characterized the structure and expression of the mouse 
ingl gene, it became clear that ingl regulation is more compli- 
cated than was originally thought, ingl is transcribed from at 
least three differently regulated promoters, and the resulting 
transcripts encode at least two different proteins. 

A similar type of regulation (alternative initiation leading to 
variability of 5'-exons) found in other tumor suppressor genes, 
namely BRCAl, APC, and INK4 (23-26), is associated with the 
generation of proteins with different functions (in the case of 
INK4, different reading frames are translated in the alterna- 
tive transcripts). Consistently, the two products of ingl have 
opposite effects on p53-dependent transcription regulation; one 
acts as a p53 cooperator, while the other acts as a p53 
suppressor. 

All of the transcripts share a common region encoded by a 
common exon but differ in their 5'-exons. Two of these alter- 
native exons do not contain protein-coding sequences (isoforms 
la and Ic), while the third one does (isoform lb). Consistently, 
one of the ingl transcripts encodes a 37-kDa protein {pS7^^^^), 
while two others are translated into a shorter protein of 24 kDa 
that surprisingly runs as if it was 31 kDa (pSl^'^^'^). Structures 
of mouse and human ingl genes are likely to be similar, con- 
sidering the high degree of evolutionary conservation of ingl 
sequences and the fact that the two versions of human INGl 
that are currently available in GenBank™ also share a large 
common part but have different N termini. It is noteworthy 
that one of the human variants of INGl (GenBank™ accession 
number AF001954) does not have a homologue with similar 
5 '-sequences among the identified mouse ingl isoforms, which 
raises the possibility that there could be more variants of 
mouse ingl transcripts that so far have not been isolated. 

pgg/^^Gi g^jj -^g homologs in such a distantly related species 



as yeast contain a remarkably conserved PHD finger domain 
and an additional, weakly conserved domain of unknown func- 
tion. PHD fingers have been shown to bind DNA (21, 27), but to 
our knowledge there is no evidence that they mediate protein- 
protein interactions. Therefore, it seems likely that the C- 
terminal PHD domain in the p33^^^ family of proteins is in- 
volved in specific DNA binding that may be important for 
transcription regulation. Given the outstanding conservation of 
this domain across the large phylogenetic distance that sepa- 
rates humans and yeast, the specificity of DNA binding with. 
respect to particiilar binding sites is expected to be conserved 
either in terms of DNA sequence, distinct features of chromatin 
structure, or both. It may be fiuther siirmised that the con- 
served N-terminal domain is involved in specific protein-pro- 
tein interactions that couple transcription regulation by 
pgg/A^cj homologs to other elements of cell cycle control. 

The conservation of domedn organization in p33^^*^^ and its 
yeast homologs is particularly notable given that yeast does not 
encode any homologs of p53. Thus, it appears that p33^^^^ is an 
ancient cell cycle regulator whose interaction with p53 is a 
later evolutionary addition. 

Regulation of ingl Expression— The results of expression 
analysis indicated that mouse ingl is a subject of regulation 
both at the protein and mRNA levels and that different iso- 
forms have different expression patterns. Thus, p37^^^^ is 
ubiquitously expressed in all tissues analyzed with elevated 
mRNA expression levels in the thymus, while levels of the 
^^-yiNCJ protein vary dramatically among organs and cell lines. 

Sizes of the ingl mRNA transcripts detected by Northern 
hybridization correlate well with the length of cDNA sequences 
isolated from all of the tissues except testis. Probes specific for 
each of the identified splice variants reveal multiple RNA 
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bands, indicating a variability of the ingl transcripts in this 
organ. It is likely that in testis ingl mRNAs are either termi- 
nated or processed differently from the rest of the tissues. 

There is a clear link between the proliferation rate of the cell 
and ingl expression both at the protein and mRNA levels. This 
observation may reflect cell cycle dependence of ingl transcrip- 
tion detected earlier for the human INGl gene; the biological 
significance of this regulation is not obvious so far. We failed to 
observe up-regulation of mouse ingl in senescent cells as it has 
been reported for human INGl (8), which suggests that regu- 
lation patterns of mouse and human ingl genes may not al- 
ways be the same. 

INGl and p53: Cooperators or Antagonists'^ — Human INGl 
has been known as a cooperator of p53 involved in negative 
growth regulation (1, 4), apoptosis (3), and senescence (2). This 
set of properties, together with the observations that show a 
decreased expression of INGl in several breast carcinoma cell 
lines and INGl gene rearrangement in one neuroblastoma cell 
line, allow INGl to be defined as a candidate tumor suppressor 
(1-4, 16, 22). Here we demonstrate that INGl encodes, in 
addition to the protein with the above described properties, 
another product that shows an opposite effect on p53 function, 
inhibiting p53-dependent transcriptional activation. Thus, a 
single INGl gene appeaired to simultaneously encode a candi- 
date tumor suppressor (p53 cooperator) and a putative onco- 
gene (p53 inhibitor). This means that a simplified view of the 
potential role of INGl in cancer should be revised. In the case 
of INGly we may have a new genetic mechanism of promoting 
cancer that involves an imbalance between the two products of 
one gene. Finther characterization of the biological activity of 
the long product of INGl, p37^^^\ indicated that it can affect 
the whole spectrum of cell properties known to be controlled by 
p53, including sensitivity to apoptosis, replica tive senescence, 
cooperation with dominant oncogenes, and drug response.^ 
Thus, overexpression of the longer product of INGl, which 
functionally acts as a p53 suppressor, could be the mechanism 
of attenuation of p53 activity in tumors that do not require 
mutations in p53 itself. 

A large scale analysis of clinical sEunples is needed to con- 
clude whether this hypothesis is correct or not. However, any 
attempts to investigate the potential cancer relevance of INGl 
should involve separate analysis of the alternative isoforms of 
this gene. It has already become obvious that a direct meas- 
iu*ement of the overall mRNA expression in tumors cannot be 
used as an approach to judge the involvement of INGl in 
carcinogenesis. The ratio between expression levels of the two 
INGl products seems to provide much more meaningful infor- 
mation. It is therefore essential to establish such assays that 
would distinguish between the two protein products or among 
the alternative INGl transcripts. 

Although the mechanism of p53 modulation by the two prod- 
ucts of the INGl gene remains unknown, we hope that the two 
new observations described in this work will shed light on this 
problem. We found that (i) p53 is found in a complex with the 
long but not with the short product of INGl and (ii) overex- 
pression of the longer product inhibits accumulation of p53 
protein after DNA damage. Both properties of p37^^*^^ clearly 
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resemble those of the natural p53 antagonist, Mdm2, which 
was shown to affect nuclear localization of p53 (28, 29) and to 
promote its proteasomal degradation (30, 31). All of these pos- 
sibilities should be tested for p37^^*^^ as well as a potential 
interdependence of the biological activity of these two proteins. 

Is the function of INGl limited to its interaction with the p53 
signaling pathway? It seems doubtful, considering the high 
evolutionary conservation of INGl. In fact, INGl homologues 
found in yeast (Fig. 4), which does not have detectable p53- 
related genes, suggest some important p53-independent role 
for INGl. The phenotypes associated with the knockout of 
INGl in yeast and mice may provide a lead toward its currently 
unknown cellular function. It is noteworthy that a gene knock- 
out approach will be complicated by the necessity to separately 
inactivate expression of each of the two products of INGl. 
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